18 research outputs found

    Natural Language Processing and E-Government: Extracting Reusable Crime Report Information

    Get PDF
    Crime reporting needs to be possible 24/7. Although 911 and tip-lines are the most publicized reporting mechanisms, several other options exist, ranging from in-person reporting to online submissions. Internet-based crime reporting systems allow victims and witnesses of crime to report incidents to police 24/7 from any location. However, these existing e-mail and text-based systems provide little support for witnesses\u27 memory recall leading to reports with less information and lower accuracy. These systems also do not facilitate reuse and integration of the reported information with other information systems. We are developing an anonymous Online Crime Reporting System that is designed to extract relevant crime information from witness\u27 narratives and to ask additional questions based on that information. We leverage natural language processing and investigative interviewing techniques to support memory recall and map the information directly to a database to support information reuse. We report on the evaluation of the Suspect Description Module (SDM) of the system. Our interface captures 70% (recall) of information from witness narratives with 100% precision. Additional modules will follow the design and development methods used with this module

    Reporting On-Campus Crime Online: User Intention to Use

    Get PDF
    National surveys demonstrate that millions of crimes go unreported in the United States. Several reasons may contribute to this lack of reporting and we are investigating these potential reasons and how they may be addressed. We are developing an online system that provides an anonymous and secure mechanism for both victims and witnesses to report crimes to police. The system is being implemented and tested on a university campus. Potential users (i.e., students, staff) were surveyed to determine their intent to use the system. Respondents claimed to report crimes already, which is in contrast with the findings from the national surveys. Our respondents found the online system useful, accessible, and safe to report crime, but the type of crime and the urgency of response is a determinant in the decision to use the system versus reporting it to a live person

    Data Mining Techniques to Study Therapy Success with Autistic Children

    Get PDF
    Autism spectrum disorder has become one of the most prevalent developmental disorders, characterized by a wide variety of symptoms. Many children need extensive therapy for years to improve their behavior and facilitate integration in society. However, few systematic evaluations are done on a large scale that can provide insights into how, where, and how therapy has an impact. We describe how data mining techniques can be used to provide insights into behavioral therapy as well as its effect on participants. To this end, we are developing a digital library of coded video segments that contains data on appropriate and inappropriate behavior of autistic children in different social settings during different stages of therapy and. In general, we found that therapy increased appropriate behavior and decreased inappropriate behavior. Decision trees and association rules provided more detailed insights for high and low levels of appropriate and inappropriate behavior. We found that a child\u27s interaction with a parent or therapist led to especially high levels of appropriate behavior and behavior is most predictable while therapy is in progress

    Crime Information Extraction from Police and Witness Narrative Reports

    Get PDF
    To solve crimes, investigators often rely on interviews with witnesses, victims, or criminals themselves. The interviews are transcribed and the pertinent data is contained in narrative form. To solve one crime, investigators may need to interview multiple people and then analyze the narrative reports. There are several difficulties with this process: interviewing people is time consuming, the interviews - sometimes conducted by multiple officers - need to be combined, and the resulting information may still be incomplete. For example, victims or witnesses are often too scared or embarrassed to report or prefer to remain anonymous. We are developing an online reporting system that combines natural language processing with insights from the cognitive interview approach to obtain more information from witnesses and victims. We report here on information extraction from police and witness narratives. We achieved high precision, 94% and 96%, and recall, 85% and 90%, for both narrative types

    A Classifier to Evaluate Language Specificity in Medical Documents

    Get PDF
    Consumer health information written by health care professionals is often inaccessible to the consumers it is written for. Traditional readability formulas examine syntactic features like sentence length and number of syllables, ignoring the target audience\u27s grasp of the words themselves. The use of specialized vocabulary disrupts the understanding of patients with low reading skills, causing a decrease in comprehension. A naive Bayes classifier for three levels of increasing medical terminology specificity (consumer/patient, novice health learner, medical professional) was created with a lexicon generated from a representative medical corpus. Ninety-six percent accuracy in classification was attained. The classifier was then applied to existing consumer health web pages. We found that only 4% of pages were classified at a layperson level, regardless of the Flesch reading ease scores, while the remaining pages were at the level of medical professionals. This indicates that consumer health web pages are not using appropriate language for their target audience

    Non-verbal Communication with Autistic Children Using Digital Libraries

    Get PDF
    Autism spectrum disorder (ASD) has become one of the most prevalent mental disorders over the last few years and its prevalence is still growing. The disorder is characterized by a wide variety of symptoms such as lack of social behavior, extreme withdrawal, and problems communicating. Because of the diversity in symptoms and the wide variety in severity for those, each autistic child has different needs and requires individualized therapy. This leads to long waiting lists for therapy

    Programs for machine learning

    No full text
    learning algorithms on word sense disambiguation with small dataset

    Using Symbolic Knowledge in the UMLS to Disambiguate Words in Small Datasets with a Naïve Bayes Classifier.

    No full text
    Current approaches to word sense disambiguation use and combine various machine-learning techniques. Most refer to characteristics of the ambiguous word and surrounding words and are based on hundreds of examples. Unfortunately, developing large training sets is time-consuming. We investigate the use of symbolic knowledge to augment machine-learning techniques for small datasets. UMLS semantic types assigned to concepts found in the sentence and relationships between these semantic types form the knowledge base. A naïve Bayes classifier was trained for 15 words with 100 examples for each. The most frequent sense of a word served as the baseline. The effect of increasingly accurate symbolic knowledge was evaluated in eight experimental conditions. Performance was measured by accuracy based on 10-fold cross-validation. The best condition used only the semantic types of the words in the sentence. Accuracy was then on average 10 % higher than the baseline; however, it varied from 8 % deterioration to 29 % improvement. In a follow-up evaluation, we noted a trend that the best disambiguation was found for words that were the least troublesome to the human evaluators. Keywords: Artificial intelligence, machine learning, naïve Bayes, wor
    corecore